Transient map method in stop consonant discrimination
نویسندگان
چکیده
Discrimination between the voiceless stop consonants !k,p,tl is a subproblern in phoneme-based speech recognition systems. Lack of energy during the pronunciation and the fast transient effects at the end of the phoneme make the recognition difficult. A method of so called Phonotopic Maps [2] was studied in order to develop simple and effective solutions for discrimination. In the following studies the method of 'fransient Maps, a derivative of Phonotopic Maps, was found to be an easy-to-implement and powerful algorithm for real-time speech recognition systems. It contains an automatic learning algorithm that tunes the discrimination elements to detect the differences between the spectra at the end of the stop consonant. Using Transient Maps it is possible to classify correctly 80 to 90 percent of all voiceless stop consonants in our speech recognition system. Thus the recognition accuracy of voiceless stop consonants is comparable to that of the other phonemes.
منابع مشابه
Application of Local Binary Patterns for SVM based Stop Consonant Detection
Detection of acoustic phonetic landmarks is useful for a variety of speech processing applications such as automatic speech recognition.The majority of existing methods use Melfrequency Cepstral Coefficients (MFCCs) describing the short time power spectral envelope of the speech signal. This paper hypothesizes that a different feature extraction method can be used to complement MFCCs by capturi...
متن کاملConsonant Class Discrimination in Dysarthric Speech Based on Support Vector Machine Using Class- Dependent Acoustic Parameters
In this paper, we propose a consonant class discrimination (CCD) method in dysarthric speech, where a support vector machine (SVM) is employed by using class-dependent acoustic parameters. To this end, each consonant is categorized into one of five classes according to the manner of articulation such as stop, affricate, fricative, nasal and glide. In the proposed CCD method using SVM, acoustic ...
متن کاملConsonant discrimination in elicited and spontaneous speech: a case for signal-adaptive front ends in ASR
The constant frame length in typical ASR front ends is too long to capture transient phenomena in speech, such as stop bursts. However, current HMM systems have consistently outperformed systems based solely on non-uniform units. This work investigates an approach to “add back” such transient information to a speech recognizer, without losing the robustness of the standard acoustic models. We d...
متن کاملConsonant burst enhancement: a possible means to improve intelligibility for the hard of hearing.
The possibility of using a circuit to amplify selectively the burst of a stop consonant is investigated. It is shown that such a circuit used in the speech channel of an amplifying system can improve the discrimination between stop consonants. Such a system could be of value in assisting the hard of hearing.
متن کاملTemporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex.
Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is i...
متن کامل